Performance Evaluation of Parallel Sortings on the Supercomputer Fugaku
نویسندگان
چکیده
Sorting is one of the most basic algorithms, and developing highly parallel sorting programs becoming increasingly important in high-performance computing because number CPU cores per node modern supercomputers tends to increase. In this study, we have implemented two multi-threaded algorithms based on samplesort compared their performance supercomputer Fugaku. The first algorithm divides an input sequence into multiple blocks, sorts each block, then selects pivots by sampling from block at regular intervals. Each partitioned using pivots, partitions different blocks are merged a single sorted sequence. second differs only selecting where binary search used select such that elements partition equal. We compare with sequential multiway merging algorithms. demonstrate BlockQuicksort (a quicksort accelerated reducing conditional branches) for selection tree shows consistently high speed efficiency various data types sizes.
منابع مشابه
the effects of planning on accuracy and complexity of iranian efl students’ written narrative task performance
this study compared the different effects of form-focused guided planning vs. meaning-focused guided planning on iranian pre-intermediate students’ task performance. the study lasted for three weeks and concentrated on eight english structures. forty five pre-intermediate iranian students were randomly assigned to three groups of guided planning focus-on-form group (gpfg), guided planning focus...
15 صفحه اولthe effects of time planning and task complexity on accuracy of narrative task performance
هدف اصلی این تحقیق بررسی تاثیر برنامه ریزی زمانی، هم چنین افزایش میزان پیچیدگی تکالیف در نظر گرفته شده بصورت همزمان، بر دقت و صحت و پیچیدگی عملکرد نوشتاری زبان آموزان می باشد. بدین منظور، 50 نفر از دانش آموزان دختر در رده ی سنی 16 الی 18 سال به عنوان شرکت کنندگان در این زمینه ی تحقیق در نظر گرفته شدند و به دو گروه آزمایشی و کنترل بصورت اتفاقی تقسیم شدند. اعضای گروه آزمایشی هر دو تکلیف ساده و پی...
Improving the Performance of Parallel Backprojection on a Reconfigurable Supercomputer
Backprojection is an image reconstruction algorithm that is used in a number of applications, including synthetic aperture radar (SAR). For radar processing applications, backprojection provides a two-step method for reconstructing an image from the radar data that are collected. First, the radar traces are filtered according to a linear time-invariant system. These filtered traces make up the ...
متن کاملParallel Performance Evaluation of Sequence Nucleotide Alignment on the Supercomputer BlueGene/P
Bioinformatics is a scientific area requiring powerful computing resources for exploring large sets of biological data. Sequence alignment is an important method in DNA and protein analysis. BLAST has become the most popular tool and implements a fast heuristic method for sequence alignment and searching. The goal of this paper is to estimate the scalability of parallel sequence alignment on th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of information processing
سال: 2023
ISSN: ['0387-6101']
DOI: https://doi.org/10.2197/ipsjjip.31.452